AITopics | Mt Mckinley

Collaborating Authors

Mt Mckinley

Trump latest: Migration crackdown, DeepSeek's rise, what's ahead on Tuesday

Al JazeeraJan-28-2025, 08:22:35 GMT

United States President Donald Trump signed a series of executive orders on Monday aimed at reshaping military policies, including the removal of diversity, equity and inclusion programmes (DEI), reinstating service members discharged for refusing COVID-19 vaccines, and barring transgender people from military service. Earlier in the day, newly confirmed Secretary of Defense Pete Hegseth, who secured the position after a narrow Senate vote, said he would ensure the orders "are complied with rapidly and quickly". Here is the latest news from Monday and a look ahead for the week. Speaking with reporters on board Air Force One on Monday, Trump said that he signed four executive orders. Among those, Trump revealed he signed an order to establish a framework for developing what his administration calls an "American Iron Dome," a missile defence system designed to protect the homeland.

executive order, trump, white house, (12 more...)

Al Jazeera

Country:

Asia > India (0.30)
North America > Mexico (0.15)
Asia > China (0.07)
(7 more...)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.42)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.42)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.42)

Add feedback

Quantifying and Mitigating Unimodal Biases in Multimodal Large Language Models: A Causal Perspective

Chen, Meiqi, Cao, Yixin, Zhang, Yan, Lu, Chaochao

arXiv.org Artificial IntelligenceApr-3-2024

Recent advancements in Large Language Models (LLMs) have facilitated the development of Multimodal LLMs (MLLMs). Despite their impressive capabilities, MLLMs often suffer from an over-reliance on unimodal biases (e.g., language bias and vision bias), leading to incorrect answers in complex multimodal tasks. To investigate this issue, we propose a causal framework to interpret the biases in Visual Question Answering (VQA) problems. Within our framework, we devise a causal graph to elucidate the predictions of MLLMs on VQA problems, and assess the causal effect of biases through an in-depth causal analysis. Motivated by the causal graph, we introduce a novel MORE dataset, consisting of 12,000 VQA instances. This dataset is designed to challenge MLLMs' abilities, necessitating multi-hop reasoning and the surmounting of unimodal biases. Furthermore, we propose two strategies to mitigate unimodal biases and enhance MLLMs' reasoning capabilities, including a Decompose-Verify-Answer (DeVA) framework for limited-access MLLMs and the refinement of open-source MLLMs through fine-tuning. Extensive quantitative and qualitative experiments offer valuable insights for future research. Our project page is at https://opencausalab.github.io/MORE.

dataset, information, mllm, (15 more...)

arXiv.org Artificial Intelligence

2403.18346

Country:

Africa > South Africa (0.04)
North America > United States > Alaska > Denali Borough > Mt Mckinley (0.04)
Europe > France (0.04)
(15 more...)

Genre: Research Report (0.40)

Industry:

Transportation > Ground (0.96)
Automobiles & Trucks > Manufacturer (0.73)
Leisure & Entertainment > Sports > Soccer (0.69)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Knowledge Graph Enhanced Large Language Model Editing

Zhang, Mengqi, Ye, Xiaotian, Liu, Qiang, Ren, Pengjie, Wu, Shu, Chen, Zhumin

arXiv.org Artificial IntelligenceFeb-21-2024

Large language models (LLMs) are pivotal in advancing natural language processing (NLP) tasks, yet their efficacy is hampered by inaccuracies and outdated knowledge. Model editing emerges as a promising solution to address these challenges. However, existing editing methods struggle to track and incorporate changes in knowledge associated with edits, which limits the generalization ability of postedit LLMs in processing edited knowledge. To tackle these problems, we propose a novel model editing method that leverages knowledge graphs for enhancing LLM editing, namely GLAME. Specifically, we first utilize a knowledge graph augmentation module to uncover associated knowledge that has changed due to editing, obtaining its internal representations within LLMs. This approach allows knowledge alterations within LLMs to be reflected through an external graph structure. Subsequently, we design a graph-based knowledge edit module to integrate structured knowledge into the model editing. This ensures that the updated parameters reflect not only the modifications of the edited knowledge but also the changes in other associated knowledge resulting from the editing process. Comprehensive experiments conducted on GPT-J and GPT-2 XL demonstrate that GLAME significantly improves the generalization capabilities of post-edit LLMs in employing edited knowledge.

editing, knowledge, llm, (16 more...)

arXiv.org Artificial Intelligence

2402.13593

Country:

Europe > Sweden (0.05)
Africa (0.05)
North America > United States > Alaska > Denali Borough > Mt Mckinley (0.04)
(2 more...)

Genre: Research Report > Promising Solution (0.68)

Industry: Leisure & Entertainment > Sports (0.95)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Symbol tuning improves in-context learning in language models

Wei, Jerry, Hou, Le, Lampinen, Andrew, Chen, Xiangning, Huang, Da, Tay, Yi, Chen, Xinyun, Lu, Yifeng, Zhou, Denny, Ma, Tengyu, Le, Quoc V.

arXiv.org Artificial IntelligenceDec-30-2023

We present symbol tuning - finetuning language models on in-context input-label pairs where natural language labels (e.g., "positive/negative sentiment") are replaced with arbitrary symbols (e.g., "foo/bar"). Symbol tuning leverages the intuition that when a model cannot use instructions or natural language labels to figure out a task, it must instead do so by learning the input-label mappings. We experiment with symbol tuning across Flan-PaLM models up to 540B parameters and observe benefits across various settings. First, symbol tuning boosts performance on unseen in-context learning tasks and is much more robust to underspecified prompts, such as those without instructions or without natural language labels. Second, symbol-tuned models are much stronger at algorithmic reasoning tasks, with up to 18.2% better performance on the List Functions benchmark and up to 15.3% better performance on the Simple Turing Concepts benchmark. Finally, symbol-tuned models show large improvements in following flipped-labels presented in-context, meaning that they are more capable of using in-context information to override prior semantic knowledge.

in-context exemplar, in-context learning, mapped, (13 more...)

arXiv.org Artificial Intelligence

2305.08298

Country:

Europe > United Kingdom (0.27)
North America > United States > California > San Francisco County > San Francisco (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)
(50 more...)

Genre: Research Report > New Finding (0.67)

Industry:

Semiconductors & Electronics (1.00)
Media > Film (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
(17 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.68)

Add feedback

Generating Data for Symbolic Language with Large Language Models

Ye, Jiacheng, Li, Chengzu, Kong, Lingpeng, Yu, Tao

arXiv.org Artificial IntelligenceMay-23-2023

While large language models (LLMs) bring not only performance but also complexity, recent work has started to turn LLMs into data generators rather than task inferencers, where another affordable task model is trained for efficient deployment and inference. However, such an approach has primarily been applied to natural language tasks and has not yet been explored for symbolic language tasks with complex structured outputs (e.g., semantic parsing and code generation). In this paper, we propose SymGen which utilizes LLMs for generating various annotation-expensive symbolic language data. SymGen consists of an informative prompt to steer generation and an agreement-based verifier to improve data correctness. We conduct extensive experiments on six symbolic language tasks across various settings. Compared with the LLMs, we demonstrate the 1\%-sized task model can achieve comparable or better performance, largely cutting inference and deployment costs. We also show that generated data with only a few human demonstrations can be as effective as over 10 times the amount of human-annotated data when training the task model, saving a considerable amount of annotation effort. SymGen sheds new light on data generation for complex tasks, and we release the code at \href{https://github.com/HKUNLP/SymGen}{https://github.com/HKUNLP/SymGen}.

artificial intelligence, large language model, natural language, (14 more...)

arXiv.org Artificial Intelligence

2305.13917

Country:

North America > United States > Montana (0.05)
North America > United States > Alabama (0.05)
Asia > Vietnam (0.04)
(20 more...)

Genre: Research Report (0.50)

Industry: Leisure & Entertainment (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)

Add feedback